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SEQUENCE LISTING 


(1) GENERAL INFORMATION: 

(i) APPLICANT: NI , JIAN 

ROSEN, CRAIG A. 
PAN, JAMES G. 
GENTZ, REINER L. 
DIXIT, VISHVA M. 

(ii) TITLEi OF INVENTION: Death Domain Containing Receptor-4 


\ 

3Ky 


(iii) NUMBER OF SEQUENCES: 11 


(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HUMAN GENOME SCIENCES, INC. 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CIT^f: ROCKVILLE 

(D) STATE: MD 

(E) COUNTRY: US 

(F) ZIP: 20850 


\ 


(v) COMPUTER READABLE FORM: 

(A) MEDIUM VtYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 28-JAN-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BROOKES, ANDERS A 

( B ) REG I STRATI ON NUMBER : 3 6,373 

(C) REFERENCE /DOCKET NUMBER: PF3 55 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (30l\) 309-8504 

(B) TELEFAX: (301) 309-8512 


(2) INFORMATION FOR SEQ ID NO : l\ 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2152 base pairs 


(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


\ 

\ 
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<ix) FEATURE: 

XA) NAME /KEY: CDS 

(\b) LOCATION: 19 . . 1422 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TTCGGGCACG AGGGGAGG ATG GCG CCA CCA CCA GCT AGA GTA CAT CTA GGT 51 

Met Ala Pro Pro Pro Ala Arg Val His Leu Gly 
15 10 

GCG TTC CTG GCA GTG ACT CCG AAT CCC GGG AGC GCA GCG AGT GGG AC A 99 
Ala Phe Leu Ala ValVrhr Pro Asn Pro Gly Ser Ala Ala Ser Gly Thr 
15 \ 20 25 

GAG GCA GCC GCG GCC ACA CCC AGC AAA GTG TGG GGC TCT TCC GCG GGG 147 
Glu Ala Ala Ala Ala ThA Pro Ser Lys Val Trp Gly Ser Ser Ala Gly 
30 \ 35 40 

\ 

AGG ATT GAA CCA CGA GGC GGG GGC CGA GGA GCG CTC CCT ACC TCC ATG 195 
Arg lie Glu Pro Arg Gly Gly. Gly Arg Gly Ala Leu Pro Thr Ser Met 
45 50\ 55 

GGA CAG CAC GGA CCC AGT GCC CGG GCC CGG GCA GGG CGC GCC CCA GGA 243 
Gly Gin His Gly Pro Ser Ala Arg Ala Arg Ala Gly Arg Ala Pro Gly 
60 65 \ 70 75 

CCC AGG CCG GCG CGG GAA GCC AGC CCT CGG CTC CGG GTC CAC AAG ACC 291 
Pro Arg Pro Ala Arg Glu Ala Ser Pro Arg Leu Arg Val His Lys Thr 

80 N^ 85 90 

TTC AAG TTT GTC GTC GTC GGG GTC CTG GTG CAG GTC GTA CCT AGC TCA 33 9 

Phe Lys Phe Val Val Val Gly Val Leu Leu Gin Val Val Pro Ser Ser 

95 100 \ 105 

GCT GCA ACC ATC AAA CTT CAT GAT CAA TCA VtT GGC ACA CAG CAA TGG 3 87 

Ala Ala Thr lie Lys Leu His Asp Gin Ser I\Le Gly Thr Gin Gin Trp 
110 115 \ 120 

GAA CAT AGC CCT TTG GGA GAG TTG TGT CCA CCa\gGA TCT CAT AGA TCA 43 5 

Glu His Ser Pro Leu Gly Glu Leu Cys Pro Pro Gly Ser His Arg Ser 
125 130 135 

GAA CGT CCT GGA GCC TGT AAC CGG TGC ACA GAG GG^GTG GGT TAC ACC 483 
Glu Arg Pro Gly Ala Cys Asn Arg Cys Thr Glu Gly Val Gly Tyr Thr 
140 145 150 \^ 155 

AAT GCT TCC AAC AAT TTG TTT GCT TGC CTC CCA TGT ACA GCT TGT AAA 531 
Asn Ala Ser Asn Asn Leu Phe Ala Cys Leu Pro Cys Thr\Ala Cys Lys 
160 165 \ 170 

TCA GAT GAA GAA GAG AGA AGT CCC TGC ACC ACG ACC AGG AAC ACA GCA 579 
Ser Asp Glu Glu Glu Arg Ser Pro Cys Thr Thr Thr Arg Asn\Thr Ala 
175 180 185 
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TGT CAG ^GC AAA CCA GGA ACT TTC CGG AAT GAC AAT TCT GCT GAG ATG 627 
Cys Gin C^s Lys Pro Gly Thr Phe Arg Asn Asp Asn Ser Ala Glu Met 
19 N 0 195 200 


lg\tgc 


TGC CGG AAG \TGC AGC ACA GGG TGC CCC AGA GGG ATG GTC AAG GTC AAG 67 5 

Cys Arg Lys Gys Ser Thr Gly Cys Pro Arg Gly Met Val Lys Val Lys 
205 \ 210 215 

GAT TGT ACG CCC\TGG AGT GAC ATC GAG TGT GTC CAC AAA GAA TCA GGC 72 3 

Asp Cys Thr Pro Vrp Ser Asp lie Glu Cys Val His Lys Glu Ser Gly 
220 \ 225 230 235 

AAT GGA CAT AAT ATA TGG GTG ATT TTG GTT GTG ACT TTG GTT GTT CCG 771 
Asn Gly His Asn IleVrrp Val lie Leu Val Val Thr Leu Val Val Pro 
240 \ 245 250 

TTG CTG TTG GTG GCT GTG CTG ATT GTC TGT TGT TGC ATC GGC TCA GGT 819 
Leu Leu Leu Val Ala VaPy Leu lie Val Cys Cys Cys lie Gly Ser Gly 
255 \ 260 265 

TGT GGA GGG GAC CCC AAG TG^T ATG GAC AGG GTG TGT TTC TGG CGC TTG 867 
Cys Gly Gly Asp Pro Lys Cys Met Asp Arg Val Cys Phe Trp Arg Leu 
270 \^275 280 

GGT CTC CTA CGA GGG CCT GGG GCT GAG GAC AAT GCT CAC AAC GAG ATT 915 
Gly Leu Leu Arg Gly Pro Gly A^a Glu Asp Asn Ala His Asn Glu lie 
285 290 \^ 295 

CTG AGC AAC GCA GAC TCG CTG TCC ACT TTC GTC TCT GAG CAG CAA ATG 9 63 

Leu Ser Asn Ala Asp Ser Leu Ser Tnr Phe Val Ser Glu Gin Gin Met 
300 305 \^ 310 315 

GAA AGC CAG GAG CCG GCA GAT TTG ACA GGT GTC ACT GTA CAG TCC CCA 1011 
Glu Ser Gin Glu Pro Ala Asp Leu Thr Gly Val Thr Val Gin Ser Pro 
320 325 330 


\ 


GGG GAG GCA CAG TGT CTG CTG GGA CCG GCA GAA GCT GAA GGG TCT CAG 1059 

Gly Glu Ala Gin Cys Leu Leu Gly Pro Ala Glu Ala Glu Gly Ser Gin 
335 340 \ 345 

AGG AGG AGG CTG CTG GTT CCA GCA AAT GGT GCT GAC CCC ACT GAG ACT 1107 

Arg Arg Arg Leu Leu Val Pro Ala Asn Gly Ala Asp Pro Thr Glu Thr 
350 355 \^360 

CTG ATG CTG TTC TTT GAC AAG TTT GCA AAC ATC GTG CCC TTT GAC TCC 1155 

Leu Met Leu Phe Phe Asp Lys Phe Ala Asn lie Val Pro Phe Asp Ser 
365 370 375 \^ 

TGG GAC CAG CTC ATG AGG CAG CTG GAC CTC ACG AAA AAT GAG ATC GAT 1203 

Trp Asp Gin Leu Met Arg Gin Leu Asp Leu Thr Lys Asn Glu lie Asp 

380 385 390 \ 395 


\ 


GTG GTC AGA GCT GGT ACA GCA GGC CCA GGG GAT GCC TTG TAT GGA ATG 12 51 

Val Val Arg Ala Gly Thr Ala Gly Pro Gly Asp Ala Leu Tyr Ala Met 
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400 405 410 

CTG ATgN^AA TGG GTC AAC AAA ACT GGA CGG AAC GCC TCG ATC CAC ACC 12 99 

Leu Met Lys Trp Val Asn Lys Thr Gly Arg Asn Ala Ser lie His Thr 
415 420 425 

CTG CTG GAT\GCC TTG GAG AGG ATG GAA GAG AGA CAT GCA AAA GAG AAG 1347 
Leu Leu Asp Ala Leu Glu Arg Met Glu Glu Arg His Ala Lys Glu Lys 
430 \ 435 440 

ATT CAG GAC CTC\TTG GTG GAC TCT GGA AAG TTC ATC TAC TTA GAA GAT 13 9 5 

lie Gin Asp Leu Leu Val Asp Ser Gly Lys Phe lie Tyr Leu Glu Asp 
445 \^ 450 455 

GGC ACA GGC TCT GCC^GTG TCC TTG GAG TGAAAGACTC TTTTTACCAG 1442 
Gly Thr Gly Ser Ala Val Ser Leu Glu 
460 ^65 

AGGTTTCCTC TTAGGTGTTA GGAGTTAATA CATATTAGGT TTTTTTTTTT TTTAACATGT 1502 
ATACAAAGTA AATTCTTAGC CACGTGTATT GGCTCCTGCC TGTAATCCCA TCACTTTGGG 1562 

\ 

AGGC TGACGC CGGTGGATCC ACTTGAGGTC CGAAGTTCCA AGACCAGCCC TGAACCAACA 1622 

\ 

TCGTGGAAAT GCCCGTCTTT TACAAAAAAA TAC C AAAAAT TCAACTGGAA TGTGCATGGT 16 82 

\ 

GTGTGCCATC ATTTCCTCGG CTAACTACGG GAGGTCTGAG GCCAGGAGAA TCCACTTGAA 1742 

CCCCACGAAG GACAGTGTAG ACTGCAGATT^GCACCACTGC ACTCCCAGCC TGGGAACACA 180.2 

GAGCAAGACT CTGTCTCAAG ATAAAATAAA ATAAACTTGA AAGAATTATT GCCCGACTGA 18 62 

GGCTCACATG CCAAAGGAAA ATCTGGTTCT CCC CTGAGCT GGCCTCCGTG TGTTTCCTTA 1922 

TCATGGTGGT CAATTGGAGG TGTTAATTTG AATGGATTAA GGAACACCTA GAACACTGGT 19 82 

AAGGCATTAT TTCTGGGACA TTATTTCTGG GCATGTCT^C GAGGGTGTTT CCAGAGGGGA 2 042 

TTGGCATGCG ATCGGGTGGA CTGAGTGGAA AAGACCTACG^CTTAATTTGG GGGGGCACCG 2102 

TCCGACAGAC TGGGGAGCAA GATAGAAGAA AACAAAAAAA AAAAAAAAAA 2152 


(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala Pro Pro Pro Ala Arg Val His Leu Gly Ala Phe Leu Ala Val 


• # 
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5 10 15 

Thr Pro Asn\ Pro Gly Ser Ala Ala Ser Gly Thr Glu Ala Ala Ala Ala 
20 25 30 

Thr Pro Ser Lys Val Trp Gly Ser Ser Ala Gly Arg lie Glu Pro Arg 
35 \ 40 45 

Gly Gly Gly Arg\Gly Ala Leu Pro Thr Ser Met Gly Gin His Gly Pro 
50 \ 55 60 

Ser Ala Arg Ala Arg Ala Gly Arg Ala Pro Gly Pro Arg Pro Ala Arg 
65 \ 70 75 80 

Glu Ala Ser Pro Arg Leu Arg Val His Lys Thr Phe Lys Phe Val Val 
85 \ 90 95 

Val Gly Val Leu Leu Gin Val Val Pro Ser Ser Ala Ala Thr lie Lys 
100 \^ 105 110 

Leu His Asp Gin Ser lie Gly Thr Gin Gin Trp Glu His Ser Pro Leu 
115 \^ 120 125 

Gly Glu Leu Cys Pro Pro Gly; Ser His Arg Ser Glu Arg Pro Gly Ala 
130 135\ 140 

Cys Asn Arg Cys Thr Glu Gly Val Gly Tyr Thr Asn Ala Ser Asn Asn 
145 150 \ 155 160 

Leu Phe Ala Cys Leu Pro Cys Thr* Ala Cys Lys Ser Asp Glu Glu Glu 
165 \ 170 175 

Arg Ser Pro Cys Thr Thr Thr Arg Asn Thr Ala Cys Gin Cys Lys Pro 
180 185 190 

Gly Thr Phe Arg Asn Asp Asn Ser Ala Glu Met Cys Arg Lys Cys Ser 
195 200 . \^ 205 

Thr Gly Cys Pro Arg Gly Met Val Lys Val Lys Asp Cys Thr Pro Trp 
210 215 \ 220 

Ser Asp lie Glu Cys Val His Lys Glu Sen Gly Asn Gly His Asn lie 
225 230 \235 240 

Trp Val He Leu Val Val Thr Leu Val Val Pro Leu Leu Leu Val Ala 
245 250 \^ 255 

Val Leu He Val Cys Cys Cys He Gly Ser Glyi Cys Gly Gly Asp Pro 
260 265 \ 270 

Lys Cys Met Asp Arg Val Cys Phe Trp Arg Leu Gly Leu Leu Arg Gly 
275 280 \ 285 


Pro Gly Ala Glu Asp Asn Ala His Asn Glu lie Leu Ser Asn Ala Asp 
290 295 3( 


i00^ 
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Ser Leu Ser Thr Phe Val Ser Glu Gin Gin Met Glu Ser Gin Glu Pro 
305 \ 310 315 320 


Ala Asp Leu Thr Gly Val Thr Val Gin Ser Pro Gly Glu Ala Gin Cys 

325 330 335 

Leu Leu Gly ProUla Glu Ala Glu Gly Ser Gin Arg Arg Arg Leu Leu 

340 \ 345 350 

Val Pro Ala Asn Gly Ala Asp Pro Thr Glu Thr Leu Met Leu Phe Phe 

355 \ 360 365 


Asp Lys Phe Ala Asn lie Val Pro Phe Asp Ser Trp Asp Gin Leu Met 
370 \ 375 380 


Arg Gin Leu Asp Leu Thr Lys Asn Glu lie Asp Val Val Arg Ala Gly 
385 3^0 395 400 

Thr Ala Gly Pro Gly Asp Ala Leu Tyr Ala Met Leu Met Lys Trp Val 
405 \ 410 415 

Asn Lys Thr Gly Arg Asn Ala Ser lie His Thr Leu Leu Asp Ala Leu 
420 \ 425 430 

Glu Arg Met Glu Glu Arg Hrs Ala Lys Glu Lys lie Gin Asp Leu Leu 
435 \^440 445 

Val Asp Ser Gly Lys Phe lie Tyr Leu Glu Asp Gly Thr Gly Ser Ala 
450 455 \ 460 

Val Ser Leu Glu 
465 

(2) INFORMATION FOR SEQ ID NO ; 3 

(i) SEQUENCE CHARACTERISTICS K 

(A) LENGTH: 6 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Leu Gly lie Trp Thr Leu Leu Pro Leu Val Leu Thr Ser Val Ala 
1 5 lo\ 15 

Arg Leu Ser Ser Lys Ser Val Asn Ala Gin Val Thr Asp lie Asn Ser 
20 25 \ 30 
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Lys Gly Leu Glu Leu Arg Lys Thr Val Thr Thr Val Glu Thr Gin Asn 
\ 35 40 45 

Leu GluVsly Leu His His Asp Gly Gin Phe Cys His Lys Pro Cys Pro 
50 \^ 55 60 

Pro Gly Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp Glu Pro 
65 \y 70 75 80 

Asp Cys Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr Asp Lys Ala His 
\^ 85 90 95 

Phe Ser Ser Lys \Cys Arg Arg Cys Arg Leu Cys Asp Glu Gly His Gly 

100 \ 105 110 

\ 

Leu Glu Val Glu Il^e Asn Cys Thr Arg Thr Gin Asn Thr Lys Cys Arg 
115 \ 120 125 

Cys Lys Pro Asn Phe Phe Cys Asn Ser Thr Val Cys Glu His Cys Asp 
130 \^135 140 

Pro Cys Thr Lys Cys Glu His Gly lie lie Lys Glu Cys Thr Leu Thr 
145 150 \ 155 160 

Ser Asn Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser Asn Leu Gly Trp 
165 \ 170 175 

Leu Cys Leu Leu Leu Leu Pro lie Pro Leu lie Val Trp Val Lys Arg 
180 \ 185 190 

Lys Glu Val Gin Lys Thr Cys Arg Yys His Arg Lys Glu Asn Gin Gly 
195 200 \ 205 

Ser His Glu Ser Pro Thr Leu Asn PrdV Glu Thr Val Ala lie Asn Leu 
210 215 \ 220 

Ser Asp Val Asp Leu Ser Lys Tyr lie Tnr Thr lie Ala Gly Val Met 
225 230 \ 235 240 

Thr Leu Ser Gin Val Lys Gly Phe Val Arg Dys Asn Gly Val Asn Glu 
245 250 \ 255 

Ala Lys lie Asp Glu lie Lys Asn Asp Asn Val^Gln Asp Thr Ala Glu 
260 265 \ 270 

Gin Lys Val Gin Leu Leu Arg Asn Trp His Gin Leu, His Gly Lys Lys 
275 280 Y285 

\ 


Glu Ala Tyr Asp Thr Leu lie Lys Asp Leu Lys Lys Ala Asn Leu Cys 
290 295 300 


\ 


Thr Leu Ala Glu Lys lie Gin Thr lie lie Leu Lys Asp He Thr Ser 
305 310 315 \ 320 

Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu He Gin Ser Led Val Met 
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325 


330 


335 


Leu Gly \le Trp Thr Leu Leu Pro Leu Val Leu Thr Ser Val Ala Arg 
340 345 350 

Leu Ser Sen Lys Ser Val Asn Ala Gin Val Thr Asp lie Asn Ser Lys 
355 \ 360 365 

Gly Leu Glu Leu Arg Lys Thr Val Thr Thr Val Glu Thr Gin Asn Leu 
370 \^ 375 380 

Glu Gly Leu His Hivs Asp Gly Gin Phe Cys His Lys Pro Cys Pro Pro 
385 \ 390 395 400 

Gly Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp Glu Pro Asp 
405 \ 410 415 

Cys Val Pro Cys Gin GluX51y Lys Glu Tyr Thr Asp Lys Ala His Phe 
420 \^ 425 430 

Ser Ser Lys Cys Arg Arg CysvArg Leu Cys Asp Glu Gly His Gly Leu 
435 \440 445 

Glu Val Glu lie Asn Cys Thr Ang Thr Gin Asn Thr Lys Cys Arg Cys 
450 455 \^ 460 

Lys Pro Asn Phe Phe Cys Asn Ser Thr Val Cys Glu His Cys Asp Pro 
465 470 \ 475 480 


Cys Thr Lys Cys Glu His Gly lie IleVLys Glu Cys Thr Leu Thr Ser 
485 490 495 


\ 


Asn Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser Asn Leu Gly Trp Leu 
500 505 \ 510 


Cys Leu Leu Leu Leu Pro He Pro Leu He Val Val Lys Arg Lys Glu 
515 520 \ 525 


Val Gin Lys Thr Cys Arg Lys His Arg Lys Glu Asn Gin Gly Ser His 

5^40 


530 


535 


Glu Ser Pro Thr Leu Asn Pro Glu Thr Val Ala Ile\ Asn Leu Ser Asp 

545 550 555 \ 560 

Val Asp Leu Ser Lys Tyr lie Thr Thr He Ala Gly Va\ Met Thr Leu 

565 570 \ 575 


Ser Gin Val Lys Gly Phe Val Arg Lys Asn Gly Val Asn Glu Ala Lys 

59 N 0 


580 


585 


v 


lie Asp Glu He Lys Asn Asp Asn Val Gin Asp Thr Ala Glu Gin Lys 
595 600 605 


Val Gin Leu Leu Arg Asn Trp His Gin Leu His Gly Lys Lys Glu\Ala 
610 615 620 
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Tyr Asp Vhr Leu lie Lys Asp Leu Lys Lys Ala Asn Leu Cys Thr Leu 
625 \ 630 635 640 

Ala Glu Lys lie Gin Thr lie lie Leu Lys Asp lie Thr Ser Asp Ser 
\ 645 650 655 

Glu Asn Ser Asn Phe Arg Asn Glu lie Gin Ser Leu Val 

66© 665 


(2) INFORMATION FOr\eQ ID NO : 4 : 

\ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES X S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gly Leu Ser Thr Val Pro Asp Leu Leu Pro Leu Val Leu Leu Glu 
1 5 \^ 10 15 

Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro His 
20 25 30 

Leu Gly Asp Arg Glu Lys Arg Asp Se\ Val Cys Pro Gin Gly Lys Tyr 
35 40 \ 45 

lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His Lys Gly 
50 55 \ 60 

Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Asp Thr Asp Cys Arg 
65 70 7 N 5 80 

Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu\ Asn His Leu Arg His 
85 90 \ 95 

Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val Glu lie 
100 105 \^ 110 

Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Gys Arg Lys Asn 
115 120 12,5 

Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys\Phe Asn Cys 
130 135 140 

Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu Lys Gin 
• 145 150 155 \^ 160 
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Asn Thr yal Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu Asn Glu 
165 170 175 

Cys Val Ser\ Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr Lys Leu 
\180 185 190 

Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser Gly Thr 
195 \ 200 205 

Thr Val Leu LeuWo Leu Val lie Phe Phe Gly Leu Cys Leu Leu Ser 
210 \ 215 220 

Leu Leu Phe lie Gly: Leu Met Tyr Arg Tyr Gin Arg Trp Lys Ser Lys 
225 \230 235 240 

Leu Tyr Ser lie Val Cys Gly Lys Ser Thr Pro Glu Lys Glu Gly Glu 
245 \ 250 255 

Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn Pro Ser Phe Ser 
260 \ 265 270 

Pro Thr Pro Gly Phe Thr ProVThr Leu Gly Phe Ser Pro Val Pro Ser 
275 ^280 285 

Ser Thr Phe Thr Ser Ser Ser Tnr Tyr Thr Pro Gly Asp Cys Pro Asn 
290 295 \^ 300 

Phe Ala Ala Pro Arg Arg Glu Val Ala Pro Pro Tyr Gin Gly Ala Asp 
305 310 \^ 315 320 

Pro lie Leu Ala Thr Ala Leu Ala Ser\Asp Pro lie Pro Asn Pro Leu 
325 \330 335 

\ 

Gin Lys Trp Glu Asp Ser Ala His Lys Pro Gin Ser Leu Asp Thr Asp 

340 345 \ 350 

\ 

\ 

Asp Pro Ala Thr Leu Tyr Ala Val Val Glu Asn Val Pro Pro Leu Arg 

355 360 \^ 365 

Trp Lys Glu Phe Val Arg Arg Leu Gly Leu Ser^Asp His Glu lie Asp 
370 375 "380 

\ 

Arg Leu Glu Leu Gin Asn Gly Arg Cys Leu Arg Glu Ala Gin Tyr Ser 
385 390 395 \ 400 


\ 


Met Leu Ala Thr Trp Arg Arg Arg Thr Pro Arg Arg Glu Ala Thr Leu 
405 410 \ 415 

Glu Leu Leu Gly Arg Val Leu Arg Asp Met Asp Leu Leu\Gly Cys Leu 
420 425 430 

Glu Asp lie Glu Glu Ala Leu Cys Gly Pro Ala Ala Leu Pro Pro Ala 
435 440 445 \ 

Pro Ser Leu Leu Arg Met Gly Leu Ser Thr Val Pro Asp Leu l)e^u Leu 


# 
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450 455 460 

Pro I^eu Val Leu Leu Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val 
465 \ 470 475 480 

lie Gly Leu Val Pro His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val 
485 490 495 

Cys Pro Gln\Gly Lys Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys 
\500 505 510 

\ 

Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro 
515 \ 520 525 

Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala 
530 \ 535 540 

\ 

Ser Glu Asn His Leu\ Arg His Cys Leu Ser Cys Ser Lys Cys Arg Glu 
545 X 550 555 560 

\ 

Lys Glu Met Gly Gin Va\L Glu lie Ser Ser Cys Thr Val Asp Arg Asp 
565 \ 570 575 


Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu 
580 \ 585 590 

\ 

Asn Leu Phe Gin Cys Phe Asn ^Cys Ser Leu Cys Leu Asn Gly Thr Val 
595 '600 605 

\ 

His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys His Ala 
610 615 \ 620 


Gly Phe Phe Leu Arg Glu Asn Glu Gys Val Ser Cys Ser Asn Cys Lys 
625 630 \ 635 640 

Lys Ser Leu Glu Cys Thr Lys Leu Cys\Leu Pro Gin lie Glu Asn Val 
645 ^650 655 


VaM 


Lys Gly Thr Glu Asp Ser Gly Thr Thr Va\L Leu Leu Pro Leu Val lie 
660 665 \^ 670 

Phe Phe Gly Leu Cys Leu Leu Ser Leu Leu Phe lie Gly Leu Met Tyr 
675 680 \ 685 

Arg Tyr Gin Arg Trp Lys Ser Asp Leu Tyr Ser\lle Val Cys Gly Lys 

690 695 7 V 00 


/\UU 


Ser Thr Pro Glu Lys Glu Gly Glu Leu Glu Gly Thr\ Thr Thr Lys Pro 
705 710 715 \ 720 

Leu Ala Pro Asn Pro Ser Phe Ser Pro Thr Pro Gly Phe Thr Pro Thr 
725 730 \ 735 

Leu Gly Phe Ser Pro Val Pro Ser Ser Thr Phe Thr Ser Ser Ser Thr 

740 745 7^0 

\ 
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Tyr Thr Pro Gly Asp Cys Pro Asn Phe Ala 
755. 760 


Ala Pro Arg Arg Glu Val 
765 


Ala Pro Pro Wr Gin Gly Ala Asp Pro lie 
770 \ 775 


Leu Ala Thr Ala Leu Ala 
780 


Ser Asp Pro lie Pro Asn Pro Leu Gin Lys 
785 \ 790 


\ 


Lys Pro Gin Ser Leu Asp Thr Asp Asp Pro 


8i05 


810 


\ 


Val Glu Asn Val Pro Pro Leu Arg Trp Lys 
820 \ 825 


Trp Glu Asp Ser Ala His 
795 800 

Ala Thr Leu Tyr Ala Val 
815 

Glu Phe Val Arg Arg Leu 
830 


Gly Leu Ser Pro His Glu lie Asp Arg Leu 
835 \ 840 


Glu Leu Gin Asn Gly Arg 
845 


Cys Leu Arg Glu Ala Gin Tyr Ser Met Leu 
850 \855 


Ala Thr Trp Arg Arg Arg 
860 


Thr Pro Arg Arg Glu Ala Thr Leu Glu Leu 
865 870 


Leu Gly Arg Val Leu Arg 
875 880 


Asp Met Asp Leu Leu Gly CysXLeu Glu Asp 
885 \ 890 


lie Glu Glu Ala Leu Cys 
895 


Gly Pro Ala Ala Leu Pro Pro Ala Pro Ser 
900 \ 905 


Leu Leu Arg 


(2) INFORMATION FOR SEQ ID NO : 5 : 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 


Met Glu Gin Arg Pro Arg Gly Cys Ala Ala Val Ala Ala Ala Leu Leu 
1 5 10 \ 15 


Leu Val Leu Leu Gly Ala Arg Ala Gin Gly Gly Thr Arg Ser Pro Arg 
20 25 


\ 30 
:ie Gly Leu^Phe 


Cys Asp Cys Ala Gly Asp Phe His Lys Lys lie Gly Leu^ Phe Cys Cys 
35 40 45 


59 


Arg GTs/ Cys Pro Ala Gly His Tyr Leu Lys Ala Pro Cys Thr Glu Pro 
50\ 55 60 

Cys Gly ^sn Ser Thr Cys Leu Val Cys Pro Gin Asp Thr Phe Leu Ala 
65 \ 70 75 80 

Trp Glu Asn His His Asn Ser Glu Cys Ala Arg Cys Gin Ala Cys Asp 
85 90 95 

Glu Gin Ala Ser\Gln Val Ala Leu Glu Asn Cys Ser Ala Val Ala Asp 
100 \ 105 110 


Thr Arg Cys Gly Cys Lys Pro Gly Trp Phe Val Glu Cys Gin Val Ser 
115 \ 120 125 


Gin Cys Val Ser Ser Ser Pro Phe Tyr Cys Gin Pro Cys Leu Asp Cys 
130 \ 135 140 


Gly Ala Leu His Arg His Thr Arg Leu Leu Cys Ser Arg Arg Asp Thr 
145 150 \ 155 160 


Asp Cys Gly Thr Cys Leu ProXSly Phe Tyr Glu His Gly Asp Gly Cys 
165 \ 170 175 


Val Ser Cys Pro Thr Ser Thr LeuvGly Ser Cys Pro Glu Arg Cys Ala 
180 \l.85 190 


Ala Val Cys Gly Trp Arg Gin Met Phe Trp Val Gin Val Leu Leu Ala 
195 200 \^ 205 

Gly Leu Val Val Pro Leu Leu Leu Gly Ala Thr Leu Thr Tyr Thr Tyr 

\ 220 

Arg His Cys Trp Pro His Lys Pro Leu Val Thr Ala Asp Glu Ala Gly 

235 


225 


230 


240 


Met Glu Ala Leu Thr Pro Pro Pro Ala Thr His ^Leu Ser Pro Leu Asp 
245 250 \ 255 


Ser Ala His Thr Leu Leu Ala Pro Pro Asp Ser Ser s Glu Lys lie Cys 

260 265 \^ 270 

Thr Val Gin Leu Val Gly Asn Ser Trp Thr Pro Gly Tyr Pro Glu Thr 

275 280 285^ 

Gin Glu Ala Leu Cys Pro Gin Val Thr Trp Ser Trp Asp Gin Leu Pro 

290 295 300 \^ 

Ser Arg Ala Leu Gly Pro Ala Ala Ala Pro Thr Leu Ser Pro Glu Ser 

305 310 315 \ 320 

Pro Ala Gly Ser Pro Ala Met Met Leu Gin Pro Gly Pro Gin LeuVyr 

325 330 335 


Asp Val Met Asp Ala Val Pro Ala Arg Arg Trp Lys Glu Phe Val ArgS 


60 


340 


345 


350 


Thr Ifeu Gly Leu Arg Glu Ala Glu lie Glu Ala Val Glu Val Glu lie 
355 360 365 

Gly Arg\Phe Arg Asp Gin Gin Tyr Glu Met Leu Lys Arg Trp Arg Gin 
370 \ 375 380 

Gin Gin Pro Ala Gly Leu Gly Ala Val Tyr Ala Ala Leu Glu Arg Met 

385 \ 390 395 400 


Gly Leu Asp Gly Cys Val Glu Asp Leu Arg Ser Arg Leu Gin Arg Gly 


\ 405 


410 


415 


Pro Met Glu GlnV Arg Pro Arg Gly Cys Ala Ala Val Ala Ala Ala Leu 
420 \ 425 430 


Leu Leu Val Leu Leu Gly Ala Arg Ala Gin Gly Gly Thr Arg Ser Pro 
435 \ 440 445 


Arg Cys Asp Cys Ala Gly Asp Phe His Lys Lys lie Gly Leu Phe Cys 


450 


\ 455 

\ 

\ 


460 


Cys Arg Gly Cys Pro Ala\ Gly His Tyr Leu Lys Ala Pro Cys Thr Glu 
465 470 \ 475 480 


Pro Cys Gly Asn Ser Thr Cys Leu Val Cys Pro Gin Asp Thr Phe Leu 

485 \^ 490 495 

Ala Trp Glu Asn His His Asn Ser Glu Cys Ala Arg Cys Gin Ala Cys 

500 \ 505 510 

Asp Glu Ala Ser Gin Val Ala Leu Glu Asn Cys Ser Ala Val Ala Asp 

515 520 \ 525 

Thr Arg Cys Gly Cys Lys Pro Gly Trp\ Phe Val Glu Cys Gin Val Ser 

530 535 \ 540 

Gin Cys Val Ser Ser Ser Pro Phe Tyr Cys Gin Pro Cys Leu Asp Cys 

545 550 XSS \ 5 

Gly Ala Leu His Arg His Thr Arg Leu Leu Cys Ser Arg Arg Asp Thr 

565 570 \^ 575 

Asp Cys Gly Thr Cys Leu Pro Gly Phe Tyr Glu His Gly Asp Gly Cys 

580 585 \^ 590 

Val Ser Cys Pro Thr Ser Thr Leu Gly Ser Cys Pro Glu Arg Cys Ala 

595 600 60.5 

Ala Val Cys Gly Trp Arg Gin Met Phe Trp Val Gin Val^Leu Leu Ala 

610 615 620 


Gly Leu Val Val Pro Leu Leu Leu Gly Ala Thr Leu Thr TyrVThr Tyr 
625 630 635 \ 640 


61 


lis cy 


Arg His Cys Trp Pro His Lys Pro Leu Val Thr Ala Asp Glu Ala Gly 


\ 


645 


650 


655 


Met Glu Ala Leu Thr Pro Pro Pro Ala Thr His Leu Ser Pro Leu Asp 
6'60 665 670 


Ser Ala His ThrVLeu Leu Ala Pro Pro Asp Ser Ser Glu Lys lie Cys 

675 \ 680 685 

Thr Val Gin Leu Val Gly Asn Ser Trp Thr Pro Gly Tyr Pro Glu Thr 
690 \ 695 700 


Gin Glu Ala Leu Cys Pr^o Gin Val Thr Trp Ser Trp Asp Gin Leu Pro 

705 ?10 \ 

Ser Arg Ala Leu Gly Pro Ala Ala Ala Pro Thr Leu Ser Pro Glu Ser 

725 \ 730 735 


Pro Ala Gly Ser Pro Ala Met \Met Leu Gin Pro Gly Pro Gin Leu Tyr 


740 


\ 


745 


750 


Asp Val Met Asp Ala Val Pro Ala Arg Arg Trp Lys Glu Phe Val Arg 
755 760\ 765 


Thr Leu Gly Leu Arg Glu Ala Glu Pie Glu Ala Val Glu Val Glu lie 
770 775 \ 780 


Gly Arg Phe Arg Asp Gin Gin Tyr Glu Met Leu Lys Arg Trp Arg Gin 
785 790 \ 795 800 


Gin Gin Pro Ala Gly Leu Gly Ala Val Tyr\ Ala Ala Leu Glu Arg Met 
805 810 \ 815 


Gly Leu Asp Gly Cys Val Glu Asp Leu Arg Ser Arg Leu Gin Arg Gly 
820 825 \ 830 


Pro 


\ 
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(2) INFORMATION FOR SEQ ID NO : 6 : 


(i) SEQUENCE CHARACTERISTICS: 

(A). LENGTH: 42 6 base pairs 
(BUTYPE: nucleic acid 

(C) \sTRANDEDNESS : single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GGCANAGGTN CGTACCTAGC TCACCTGCAA CCATCAAACT TNATGATCAA TCAATTGGCA 60 

CACAGCAATG GGAAACATAG^CCCTTTGGAA GANTTGTNTC CACCAGGATC TCATAGATCA 12 0 

AAACATCCTG GGAGCCTGTT ^ACCGGTGCC CCAAAGGNTG GTCAAGGTCA AGGAATTGTT 180 

NCGCCCTGGA AGTGAACATC GAGTGTNTCC ACAAAGGATT CAGGCAATGG GACATAAATA 240 

TATGGGTGAA TTTTGGTTGT GA^CTTTGGT TGNTCCCGTT GNTGTTGNTG GCTGTGCTGA 3 00 

TTGTTTGTTG TTGCATCGGC TTCAGGTTNT GGAGGGGGAC C C AAGTGC AT GGACAGGGTG 3 60 

TGTTTCTGGG GTTTGGGTCT CTTAG^GGGC NTGGGTTANG GCANGTTCAC AAGGGTTTTA 42 0 

GCAANG \^ 42 6 
(2) INFORMATION FOR SEQ ID NO : 7« 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 39 base pa\Lrs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 


c acid \ 
!: single\ 


(ii) MOLECULE TYPE: DNA (genomic') 

\ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

v 

TGGGGC TGAG GACAATGCTG ACNACGAGAT TCTGAGCAAC GCAGNACTNG CTGTCCACTT 60 

TCGTCTNTGN GCAGCAAATG GAAAGCCAGG AGCCGGCAGA TTTGACAGGT GTCACTGTAC 12 0 

AGTCCCCAGG GGAGGCACAG TGTCTGCTGG TGAGTTGGGG ACAGGCCCTT GCAAGACCTT 180 

\ 

GTGAGGCAGG GGGTGAAGGC CATGNCTCGG CTTCNNNTGG TCAAAGGGGA AGTGGAGCCT 240 

\ 

GAGGGAGATG GGACTTNAGG GGGACGGNGC TGCGTGGGGA ^^A?^AGCAGCC ACCNTTTGAC 3 00 


AAGGGGGACA GGCATTTTTN CAAATGTGTG CTTNTTGGT V 3 39 


\ 
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(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) T^fPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

\ 

(n) MOLECULE\ TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCGGCATGCA TGATCAATCA ATTGGCAC 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: V 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : \ 1 inear 


(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GCGGGATCCG CCATCATGGC GCCACCACCA GCTAGA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acrd. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ^D NO: 10 
GCGGGATCCT CACTCCAAGG ACACGGCAGA GCG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l\ 


GCGGGATCCT CAATTATGTC CATTGCCTG 


